04:00
2026-06-17
arxiv.org
ai-safety
Pulling The REINS: Training-Free Safety Alignment of Video Diffusion Models via Representation Steering
Researchers introduced REINS, a training-free method that steers video diffusion models away from unsafe content at inference time by manipulating internal representations. The approach, which adds a โฆ